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We provide a simple, unified approach to describing the impact of super-sample covariance on power spec- 
\ trum estimation in a finite-volume survey. For a wide range of survey volumes, the sample variance that arises 

from modes that are larger than the survey dominates the covariance of power spectrum estimators for modes 
c 2 ^ ' much smaller than the survey. The perturbative and deeply nonlinear versions of this effect are known as beat 

' coupling and halo sample variance respectively. We show that they are unified by the matter trispectrum of 

squeezed configurations and that such configurations obey a consistency relation which relates them to the re- 
^ ' sponse of the power spectrum to a change in the background density. Our method also applies to statistics that 

' are based on radial projections of the density field such as weak lensing shear. While we use the halo model 

for an analytic description to expose the nature of the effect, the consistency description enables an accurate 
calibration of the full effect directly from simulations. It also suggests that super-sample covariance may be 
viewed as an additional interesting signal rather than excess noise. 
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I. INTRODUCTION 



Cosmic acceleration is perhaps the most tantalizing problem in modern cosmology. To unlock its mysteries, a number of 
ambitious wide-field optical and infrared galaxy surveys have been proposed. These range from ground-based imaging and 
spectroscopic surveys such as the Subaru Hyper Suprime-Cam (HSC) Survey [1], the Dark Energy Survey (DES) [2], the 
Kilo-Degrees Survey (KIDS) [3], the LSST [4], the Baryon Oscillation Spectrograph Survey (BOSS) [5], the Extended BOSS 
^ ■ (eBOSS) [6], the BigBOSS [7], and the Subaru Prime Focus Spectrograph (PES) Survey [8] [see also 9] to space-based optical 
'— and near-infrared missions such as the Euclid project [10] and the WEIRST project [11]. Each of these surveys approaches 
^ . the nature of cosmic acceleration using multiple large-scale structure probes: e.g. weak gravitational lensing, baryon acoustic 
' oscillations, redshift-space distortion and cluster-counting based cosmology [see Ref. 12, for a recent review]. 
. To attain the full potential of such surveys, it is important to understand the statistical properties of large-scale structure probes 
OS ' and the matter density field that underlies them. Its two-point coiTelation function or the Fourier-transformed counterpart, the 
0^ , power spectrum, is the simplest and most commonly used statistical quantity to extract cosmological information from the 
' large-scale structure probes. The statistical precision of power spectrum measurements is determined by their covariance matrix 
fSj \ that itself contains two contributions; the measurement noise and sample variances caused by the incomplete sampUng of the 
' fluctuations due to a finite-volume survey. 

\ Even though the initial density field is nearly Gaussian, the sample variance of large-scale structure probes gets substantial 
[ ' non-Gaussian contributions from the nonlinear evolution of large-scale structure [13-15]. Most of the useful cosmological 
I> ' information in such probes lies on the weakly or deeply nonlinear regime in the matter distribution. In the nonlinear regime, 
the different Fourier modes are no longer independent and correlated with each other. Hence, the off-diagonal correlations in 
the power spectrum sample variance dominate and are described by the connected 4-point correlation function of the matter 
^ distribution, the matter trispectrum. To realize the statistical power carried by power spectrum, it is important to accurately 
model the matter trispectrum in the nonlinear regime. The standard methods to study the power spectrum sample variance 
are based on simulations of the A-dominated cold dark matter (ACDM) model [13, 14, 16-24] which can be used to test and 
calibrate semi-analytic descriptions such as the halo model approach that can encompass a larger range of model parameters 
[15,22, 25-27]. 

One further complication arises. Nonlinear evolution couples short-wavelength modes relevant for the power spectrum mea- 
surement with the very long-wavelength modes outside a survey volume, the so-called super-survey modes. The super-survey 
modes are tricky to consider, because their effect vanishes for power spectrum measurements of simulations with periodic 
boundary conditions that have no contribution of modes outside the simulation box. Super-sample variance was originally 
pointed out in Ref. [28] for the number counts of halos and Ref. [29] for the power spectrum, which were followed by many 
studies [16, 17, 19, 20, 22, 25, 27, 30, 31]. These studies have shown that super-sample variance dominates the non-Gaussian 
errors in the weakly or deeply nonlinear regime. The effect in the weakly nonlinear regime arises from mode coupling predicted 
by perturbation theory [32], which is called the beat-coupling effect [16]. The effect in the deeply nonlinear regime arises via 
the halo bias theory [33], which is called the halo sample variance [19, 22, 25] [see also 28]. Super-sample variance contributes 
to a saturation in the information content carried by the power spectrum amplitude [16, 19, 21, 22, 34] [see also 35, for tests on 
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real data]. The saturated information content has triggered a further discussion on how to recover the information content that 
was originally contained in the initial Gaussian field, by using the nonlinear transformation method [e.g. 36, 37] or including the 
higher-order moment information [22, 38]. 

However, it is still unclear how all of these super-sample variance eff'ects are described by the matter trispectrum and how they 
can best be quantified and treated in parameter estimation. In this paper, we provide a systematic study of common origin of all 
such effects, unify their description, and show how they can be directly quantified. 

The structure of this paper is as follows. In § II, we develop a simple, unified approach to describe the super-sample covariance 
of power spectrum estimation in a finite-volume survey and show its relation to the response of the power spectrum to a change 
in the background density. In § III, we use the halo model to compute the power spectrum covariance for a ACDM model and 
verify that we can recover the beat-coupling and halo sample variance results from our description. § IV is devoted to discussion. 
We provide an Appendix that extends the formalism to projected density field statistics such as weak lensing shear. 



II. POWER SPECTRUM COVARIANCE IN A SURVEY WINDOW 



In §11 A we review the construction of power spectrum estimators in a finite-volume survey and discuss their relation to the 
underlying true power spectrum. We relate the covariance of these estimators to the matter trispectrum in §11 B and identify 
where the additional sample variance from modes larger than the survey arises. In §11 C, we present a trispectrum consistency 
relation that straightforwardly relates this effect to the response of the power spectrum to a change in the background density 
and outline how it can be easily caUbrated in simulations. As such, it may also be considered as signal rather than noise, carried 
by a single additional parameter, the variable background density. 



A. Power spectrum estimator 

Assume we measure the underlying density fluctuation field 6{x) through a survey window function W{x) which is 1 in the 
measured region and in the unmeasured region. Note that by construction we implicitly assume that the observable is the 
density fluctuation itself or equivalently the true mean density is known. This is appropriate for statistics related to weak lensing 
that probes the differential gravitational field, i.e. the tidal field. If the density fluctuation field is measured relative to the mean 
of the survey region there is a correction factor that we will describe in §11 C (see also Ref. [31]). 

Regardless of the complexity of the survey geometry, we can always define the observed field as 



6w{x) = (5(x)iy(x), (1) 



whose Fourier transform is a convolution 



Mk)= -_^W(q)5(k-q), (2) 

where T^(k) is the Fourier transform of the survey window function. We have here employed the continuous limit of discrete 
Fourier transforms under the approximation that the total volume for the Fourier transform is much greater than the survey region 
(see Ref. [22, 25] for a pedagogical derivation of power spectrum estimator and the covariance based on the discrete Fourier 
decomposition). 

Let us next define an estimator of the power spectrum as 

P{kd = — I —6wik)6w(-k), (3) 



yw J\k\ek^ n-i 



where the integral is over a shell in ^-space of width and volume Vk. - AnkfAk for Ak/k, <?c 1. Here the effective survey 
volume is defined as 



Given the definition of the power spectrum 



yw = J d^^ W(x). (4) 
<5(k)5(k')) = (2;r)34(k + k')P{k), (5) 

the ensemble average of its estimator is 

(Pikd) = — -TT 7^ \mq)\ i^(k - q). (6) 

Vw J\k\ek, Vk, J {2ny 
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Thus, the observed power spectrum is given as a convolution of the underlying power spectrum with the window function 
combining the density modes separated by less than the Fourier width of the window. In the general case, one would deconvolve 
the window in constructing an unbiased estimator [39, 40]. 

In this paper, we are interested in the effect of the global survey geometry on the power spectrum covariance, not in the effect 
of masked regions at small spatial scales. In this case, the window function has a width of ~ 1/L in Fourier space, where 
L ~ v]^^ or more generally its smallest dimension. When we focus on wavenumber modes satisfying » 1/L, we find that the 
power spectrum estimator of Eq. (3) is unbiased: 

Vw J\\,\ek, Vk, J (27iy Vw J {2Tiy 

Here we have used that P(|k - q|) ^ P{k) over the integration range of c/^q which the window function supports and also assumed 
that P{k) is not a rapidly varying function within the k-b\n. In the third equality on the rh.s., we have used the general identity 
for the window function: 

(27r)^^4(qi..,,), (8) 
where qi...„ = qi + . . . q„ here and below. For n -2,Vw - J \W{q)\^d^q/(27Ty . 



B. Power spectrum covariance 



Now consider the power spectrum covariance, which can be defined in terms of the power spectrum estimator as 

Qj = Coy[P(ki), P{kj)] = {P(kdP(kj)) - (Pikd) {P{kj)) . 



(9) 



Here we consider sample covariance only. A real measurement will have measurement noise covariance, but we do not consider 
its efifect in this paper. In the same A: » 1 /L limit, the covariance becomes 



Cu - 
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^^2P{kif6fj + T'^(ki,kj) 



(10) 



where df: is the Kronecker delta function; df, = I if kj = A:, to within the bin width, otherwise df: = 0. The second term, 

u _ 'J ^ u 

proportional to T^ik,, kj), is the non-Gaussian contribution arising from the connected 4 point function or trispectrum. 



<§(ki)5(k2)5(k3)§(k4)>, = (27r)34(ki234)r(ki,k2,k3,k4), 



convolved with the survey window function: 

-'Ikle*, 



Vw J|k|e*, n-i JlkfeA-, n, J {27iy 



(2;r)-'5^(qi234)7'(k + qi,-k + q2,k' + q3,-k' +q4). 



(11) 



(12) 



The convolution with the window function means that different 4-point configurations separated by less than the Fourier width 
of the window function and involving contributions from super-survey modes contribute in principle. In deriving Eq. (10), we 
have again used the general window function identity Eq. (8) with « = 4 to eliminate one factor of 1 IVw in the power spectrum 
term under the same approximation as in Eq. (7). If the same slowly-varying approximation were true of the trispectrum, it can 
be taken out of the integral and we would obtain the standard result [13] 



Ci. 



___2P(^,.)5,. + C,., 



where 



c™=±r ^r(k,-k,k',-k'), 

Vw J|k|eA-,- Vk: J\k'\ekj Vkj 



(13) 



(14) 



SO that the whole covariance scales as 1 /Vw- Furthermore the first term can be understood from Gaussian statistics: the number 
of independent A:-modes in the shell is given as 



VkVw 27Tk^AkVn 



2(27r)3 



(271)3 



(15) 
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such that 

The factor 2 in the denominator of A^mode arises from the reaUty condition of the density field, i.e. 5^(k) - dw{-ii)- The 
difference in bin width scaling between the Gaussian and non-Gaussian terms means that the latter becomes more prominent 
in the variance when averaging over wide bins or equivalently when considering the covariance of narrow bins. 

For the non-Gaussian term, there are additional effects if the trispectrum at » l/L has structure on the scale of the survey 
Ak < l/L. We next consider the general origin and description of such a term. 



C. Super-sample covariance and trispectrum consistency 

The trispectrum term that governs the additional effects are so-called squeezed quadrilaterals where two pairs of sides are 
nearly equal and opposite. In Eq. (12), we can make the change of variables k + qi <-» k and qi + q2 qi2 under the delta 
function condition qi234 = and the approximation that qn <K k, k'. The term of interest therefore is 

lim T(k, -k + qi2, k', -k' - qi2). (17) 

In this limit, the 4 point configuration describes the connection between P{k) and P{k') through a shared infinite wavelength 
mode qi2. This mode acts like a background density or constant mode to the short wavelengths k and k'. It follows therefore 
that the squeezed trispectrum can be characterized by the response of P{k) to a fluctuation in the background density 5h through 

8P(k)dP(k') 

f (k, -k + qi2, k', -k' - q,2) ^ r(k, -k, k', -k') + I ' \ ' pHqn). (18) 

The overbar here refers to an angle average over the direction of qn since any directional dependence cannot be quantified by a 
purely constant mode. Here P^ is the linear power spectrum and is designated as such to remind the reader that for this relation 
to be applicable 6b must be a mode in the linear regime. On the other hand, there is no such restriction on the fc-modes of Pik). 

In terms of the power spectrum covariance, this relation has the direct interpretation that the measured power in the k and k' 
bins are correlated by the underlying background fluctuation 6b that they share. Trispectrum consistency then implies that the 
covariance is 

-.G , ^TO , ,^,2 dP{kddP{kj ) 
d6b d6b 



Cii = C,.) + C,-^. + — (19) 



where we have introduced the variance of the background density field 6b in the survey window, defined as 



{a^^Y = — \ —^\W{qrP^{q). (20) 



In reducing Eq. (12) with the consistency trispectrum Eq. (18) to Eq. (19), we used the following identity for the window function 
from the convolution theorem: 

/ ~ ^''^ " / = / ^'xW(x)e'''i = 1^(q). (21) 

We call the 6h type covariance term the super-sample covariance of the power spectrum estimator Note that because the back- 
ground correlates changes in P{k) for all k, it can appear as the dominant contribution to the (co)variance when the measurement 
involves a large number of A:-modes. It scales with the volume of the survey only through (o"^)"^ whereas the other terms scale 
like white noise Vy^ . Until the survey becomes much larger than the turnover of matter power spectrum on the horizon scale of 
matter-radiation equality, its relative contribution remains important [22, 28] (see also §111 B). 

As in the case of super-survey mode effects for number counts [41], their impact on the power spectrum should perhaps not 
be considered as excess noise but rather a new signal; the fractional power spectrum response is a template that changes the 
measured power spectrum coherently across bins as 

m^m(l.^.| (22) 

The uncertainty is carried by a single unknown parameter 6b drawn from a Gaussian of variance (cr^)^. Use of the covariance 
formalism in data analysis would premarginalize over this parameter, but it may alternately be a fitting parameter for the power 
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spectrum that has a prior given by cr^. This point of view may also be more appropriate for surveys that are not volume limited, 
i.e. where the window is not strictly or 1. Loss of information on other cosmological parameters of interest that also change 
the power spectrum only comes through degeneracies with this single mode [see also 25, 27, for the similar discussion]. 

It is also easy in this language to account for the alternate definition of the density fluctuation field as relative to the mean of 
the survey region. In this case the observed power would be re scaled as Pw{k) - P{k)l{\ + Sbf and the trispectrum consistency 
would take the same form as Eq. (18) and (19) but with the response to the background mode altered to be 

dP{k) dPw(k) dP(k) 

OOh OOb 00b 

which generalizes the treatment in Ref. [31]. Given that the fractional response function dlnP(k)/d6h is typically positive and 
order unity, the response in Pw{k) can be significantly reduced compared with P{k). Note again that if the observable of interest 
does not depend on the mean density, which is the case for weak lensing shear that probes the differential gravitational field 
(tidal field), P{k) rather than Pw{k) is the relevant quantity. 

The fractional response function din P(k)/d6b is a quantity that can be directly calculated in simulations since introducing 
dh is equivalent to simulating an FRW universe with different background densities [42-A4]. For example, in principle for 
its evaluation at z = only two simulations would be required to calibrate all such effects deep into the nonlinear regime. 
Nonetheless, we find it illustrative to highlight the main features of power spectrum super-sample covariance, as well as derive 
Eq. (19) expUcitly, analytically through the halo model. This will also allow us to make contact with the existing literature on this 
effect, namely "beat coupling" (BC) and "halo sample variance" (HSV) for its weakly and fully nonlinear incarnations. There 
are also terms related converting P{k) to observables related to changes in the distance-redshift relation if the background mode 
encompasses the entire volume out to the observer These in principle can affect the interpretation of baryon acoustic oscillation 
features (see [45] for related sub-survey effects). We leave a more precise calibration and observational considerations to future 
work. 

To be comprehensive, in the Appendix we also develop the analogous formulae to describe the power spectrum covariance for 
a two-dimensional, density field that is obtained by projecting the three-dimensional field, weighted with a selection function, 
along the line-of-sight. The formulation developed in this paper can be straightforwardly extended to covariance theory for 
higher-order correlation functions [see 22, for the attempt to model the BC effect on the lensing bispectrum covariance], and the 
super-sample variance effects should be similarly described by the response of the higher-order correlation to a change in the 
background density. 



III. HALO MODEL APPROACH 



We have seen in the previous section that the power spectrum estimator covariance in the survey window is generally de- 
scribed by the matter trispectrum and the specific effects of super-sample covariance by its squeezed configurations. The matter 
trispectrum itself can be approximated in the halo model by considering correlations induced between dark matter halos via 
perturbation theory and within halos via the universal density profile [15, 46]. Although the halo model is an empirical model 
to describe the nonlinear clustering, it gives a fairly accurate prediction compared to the simulations, e.g. to within a 10-20% 
accuracy in the power spectrum amplitude at scales of interest [22]. Evaluating the covariance using the halo model can therefore 
illustrate the general consistency construction that the additional super-sample covariance terms are equivalent to the response 
of the power spectrum to a background density mode. We develop the halo model formalism in §111 A and illustrate it in the 
ACDM context in §111 B. 



A. Formalism 



In the halo model [46^9], the power spectrum itself is described as 

P{k)^4(k,k) + [ll(k)fpHk\ (24) 

where the first term involves two points correlated by being in the same halo and the second two points in separate halos that are 
themselves correlated by the linear power spectrum. We use the general notation [15] 

« r dn ImY 

I'f,(ki,k2,...,kf,) s JdM—l—j bfiUM(ki)uMik2)---UM(kf,), (25) 

where M is the halo mass, dn/dM is the halo mass function, = I, bi - b{M) is the halo bias, and UM{k) is the Fourier 
transform of the halo density profile normalized so that mm(0) - 1. We have here assumed linear halo bias in that bp - Q for 
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P > 2 [see 15, for a possible extension of the halo model to including the nonlinear halo bias]. This approximation does not 
affect the main results of this paper Note that for a halo bias that satisfies the peak-background consistency relation 



r dn M\ 
\dM \ — \b{M) = 1, 



(26) 



/}(0) = 1, the 2 halo term as ^ — > is simply the linear power spectrum. Furthermore the same peak-background consistency of 
the bias [33] says that 



86,- 



(27) 



which will be useful in relating the power spectrum covariance to its response to a background mode 5;, (see §11 C). 

Likewise, the halo model approach tells us that the matter trispectrum arises from contributions involving one to four halos: 



(28) 



where T^^ , ■ • ■ , T'^^ denote the 1-, 2-, 3- and 4-halo terms that arise from correlations between four points that reside in the 1 
halo (the same halo) and from 2 to 4 different halos, respectively. Using the notations defined in Ref. [15], the different halo 
terms are given as 

r'''(ki,k2,k3,k4) = /^(^l,^2,^3,^4), 

r^^'(ki, k2, k3, k4) = P^{ki2)l\{kuk2)l\{h, k^) + 2 perm., 



rf^'(ki, k2, k3, k4) = P^{ki)l\{kx)l\{k2, k^, k^) + 3 perm., 

r^^ki, k2, k3, k4) = B"(ki , k2, k34)/|('ti)/|('t2)/2(''^3, ^^4) + 5 perm., 



r^'xki , k2, k3, k4) = r^'(ki , k2, k3, k4)/;(^i)/;(^2)/;(^3)/;(^4). 



-PT 



(29) 



The 2-halo term has two contributions, T'^'^ and where one or two point(s) among the four points are in the first halo, with the 
remaining points in the second halo. Here EP- and are the matter bispectrum and trispectrum given based on perturbation 
theory [32] 

B"(ki,k2,k3) = 2F2(ki,k2)P^(fci)P^(fc2) +2perm., 
rPT(ki,k2,k3,k4) = 4 [F2(ki3, -•ki)F2{\i,MPHhi)PHh)PHk2) + llperm.j 

+6 [F3(ki, k2, ^i)PHh)PHk2)PHh) + 3 perm.] , (30) 



where 



^'2(ki,k2)^^ + ^ 



F3(ki,k2,k3) = 



7 ki2-ki 



k~ k^ 



.. , , 2 (ki-k2)^ 

^■^'•■^^^^7^5^' 
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[F2(k2,k3) + G2(ki,k2)] + 



1 ^?,(ki ■ k2) 
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k^k\ 



[G2(k2,k3) + G2(k,,k2)], 



G2(k„k2).,.-|^.^^ 



, . 4 (ki ■k2)^ 



(31) 



Strictly speaking, the mode-coupling kernels, F„ and G„, are exact only for an Einstein de-Sitter model (Qmo = 1) and have 
a very weak dependence on the density parameters for models with ^ 1 [32]. Here we employ the Einstein de-Sitter 
approximation for simplicity. This impacts how we make the comparison to the response of the power spectrum to a background 
mode. 

If k, k' » q, each halo term of the trispectrum can be approximated as 
r"'(k, -k + qi2, k', -k' - q^) ^ r"*(;t, k, k', k'), 

T^^ik, -k + qi2, k', -k' - qi2) - T^^ik, -k, k', -k') + PHqi2)ll{k, k)ll{k', k'), 
rf^'(k, -k + qi2, k', -k' - qn) ^ T^g^k, -k, k', -k'), 
T^'\k, -k + qi2, k', -k' - q^) ^ T^\k, -k, k', -k') 

+4/^(^,A:)[/;(A:')]'PH?i2)f''(^')^'2(qi2,k')+4/^(^',^')[/J(^)]'i^'^(?i2)P'^(^)F2(-qi2,k), 
r^^k, -k + q,2, k', -k' - qi2) - T*''(k, -k, k', -k ) 

+Wl{k)ll(k')fpHqn)PHk)PHk') [F2(qi2, -k)F2(qi2,k') + F2(qi2, -k)F2(qi2, -k')] , (32) 
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where we have used approximations such as |k + qi | ^ ^. 

Inserting the halo model expressions for the matter trispectrum into Eq. (12), we find the non-Gaussian term can be broken up 

as 

^Cff + C'''', (33) 

where CjJ^ was given in Eq. (14) and is the standard non-Gaussian term involving only oppositely directed Fourier modes [13]. 
The last term Cf^'^ is the sum of the pieces involving P^iqn) in the T^^, T^'\ and T'*'' terms of Eq. (32). We shall see that the 
combined terms are exactly the super-sample covariance term of Eq. (19). 

To see this relationship let us begin with the remaining piece of the term. It can be simplified as 



J|k|£A-, Vk, Jiki'eA-^ Va, J (iny (Iny (iny 

- liiki, ki)ll(kj, kj)^ f ^\W(q)fp\q), (34) 

w v 

where we have used the variable change, qn q, in the 2nd line on the r.h.s., and we have use the identity (Eq. 21) for the 
window function. Note that the remaining integral over P^ is simply the variance of the linear density field convolved with the 
window function, Eq. (20). The HS V term can now be simply expressed as 

= ((T^flliki, kdl\{kj, kj). (35) 

We have labeled this term "HSV" for halo sample variance as it takes the form originally pointed out in Ref. [19] in their Eq. (18) 
[see also Section 3.4 in Ref. 22], although those works focused on the two-dimensional case (i.e. lensing power spectrum). 
The super-sample covariance consistency relation Eq. (19) is now manifest with Eq. (27) in the 1-halo dominated regime 

d\nP(k) ^ I\{k,k) 
ddb ~ I^iKk) 

In the halo model this consistency arises from compatibility of the bias with the mass function. The number density of rare halos 
responds to the background mode more strongly according to its bias and so the response function Eq. (36) involves the average 
bias of the halos contributing to the bin in k. 

Next let us consider the remaining P^ piece to the T"*'' term. This can be similarly simplified as 

op 1 r d\ r d\' rd\i d\o d\ ~ ~ - - , 

J|k|EA-, Vk, Jiki'eA, Vkj J {2ny (Iny (27Ty 

X [8[ll(k)ll{k')fpHk)PHk') [F2{q, -k)F2(q, k') + F2(q, -k)F2(q, -k')]} 

where we have used the identity (Eq. 21) for the window function and used the azimuthal-angle average of the perturbation 
theory kernel F2 as 



X 



't/i/12 5 2 17 



This type of term was called beat coupling (BC) in Ref. [16]. Our expression differs from their Eq. (94) only through the 
inclusion of the /J terms, which mediate the transition to the 1 halo regime. 

With these simplifications, the BC term is also exactly what we obtain from the consistency relation in the perturbative regime 
where the 2 halo term dominates the power spectrum. The background response, 

d\nP(k) _ 68 

ddb 21' ^ ^ 

can be derived by considering the growth in a separate universe with rescaled mean density in the Einstein-de Sitter approxima- 
tion that matches the assumptions for the F2 kernel (see Eq. 122 in Ref. [44]). 
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b 10" 



sphere-shaped geometry: V^={4TT/3)r 
cyhnder-shaped geometry: Vf^=-nr^x-nr 



cylinder— shaped geometry 
F™=(7rr^)xi with K„=l(GpcA)'^ fixed 




Volume: [Gpc/hf 



0.1 1 
radius: r [Gpc/h] 



FIG. 1. The variance of the linear mass density field convolved with the survey window function, (cr{^)^ for a ACDM model and z = 0. 
Left: variance as a function of survey volume for spherical Vw = (4;T/3)r^ and cylindrical windows Vw = nr^l; I = nr. We show the variance 
multiplied by the volume to illustrate that for a wide range in survey volumes the relative impact of super-sample variance and the other terms 
which scale as l/Vw remains the same within a factor of 2 due to the flatness of P^(k) around matter-radiation equality. Right: cylindrical 
windows of fixed volume Vw = Jtp-l = l(Gpc//i)\ Optimizing for both minimum variance and a non-elongated geometry (see text) yields 
Z = ;rr as the best choice for volumes that exceed the matter-radiation turnover. As shown in the left panel, this choice has a lower variance 
than the spherical window of the same volume. 



The final term is the piece of the T^'' term 

-/2i,/ /^j3„, j3 



(^HSV-BC 



J|k|E*:, Jiki'ekj Vkj J (27Ty (Iny (Ijry 

X {4[llik)fl^,ik', k')P\k)F2{q, k') + A[l\{k')]h\{k, k)P\k')F2(q, k)} 

- (c4)' ^^^Ulikdfl'iikj, kj)P\kd + a ^ ;)| . (40) 

This contribution takes the form of the cross term between HS V and BC. 
We can now see that the sum of all three terms 

^SSC _ ^HSV ^HSV-BC ^BC ^^i i s 



reproduces the general expression for the super-sample covariance of Eq. (19) with the total halo model response 

dlnP(k) ^ §[ll(k)]^PHk) + ll(k,k) 
ddh ~ [l}(k)]^PHk) + I^(k,k) ' 



(42) 



Thus the halo model obeys the consistency relation for the trispectrum of Eq. (18) and illustrates that super-sample covariance 
can be described by the response of the power spectrum to the background density. Beyond the halo model, it can be accurately 
cahbrated directly from simulations. 



B. ACDM examples 

To illustrate the power spectrum covariance for a ACDM model, we employ cosmological parameters that are consistent with 
the WMAP 9-year result [50]: Qmo/J^ = 0.136, Q.boh~ - 0.0226 and = 0.278 for the density parameters assuming a flat 
universe and n, - 0.972 and A, = 2.41 x 10"^ at A: = 0.002 Mpc ' for the primordial power spectrum parameters. We use the 
fitting formula in Ref. [51] to compute the transfer function for the model. Note erg = 0.83. As for the halo model ingredients, 
the halo mass function, the halo mass profile, and the halo bias, we used the same modes as in Ref. [52]. 

The fundamental building block for the super-sample covariance effect is the variance of the density field averaged over the 
survey window, (cr^)^. Our approach is not limited to simple window geometries since the power spectrum at high k should 



9 




k [h/Upc] k [ft,/Mpc] 

FIG. 2. The response function of the power spectrum to the super-survey mode, d\nP(k)/dSb for the halo model (Eq. 42). Left: z = 0- 
Thin solid and dot-dashed curves are the halo sample variance and the beat-coupling term respectively. The total power shows a plateau up to 
k ~ I /j/Mpc, having d In P/d6b - 3 for the amplitude. Right: z = 0, 1 and 2. At higher redshifts the plateau develops a small ridge due to the 
larger response of more rare halos. 



respond to the background density in the same way regardless of geometry. Here we consider spherical- or cylinder-shaped 
geometries as working examples for which the window functions are 

\W(k)\ =2— ■ ,,- Vw, Vw = {7Tr )l, (43) 

K±r k\\l/2 

where = k\ + k^^. The left panel of Fig. 1 shows how the variance scales with survey window volume compared to 1/Vw, 
which is the scaling of the standard covariance terms in TO (Eqs. 13 and 14). The main result is that for both geometries the 
scaling of (cr^)^ differs from the standard scaling only by a factor of 2 across the whole 4 orders of magnitude in volume. This 
is because the matter power spectrum P^{k) is relatively flat on the corresponding scales. It is only for volumes significantly 
greater than 1 (Gpc/Zj)^ that the super-sample variance effect declines relative to the other terms. 

For a cylinder window, we employ a specific shape given by / = nr. This is motivated by the study in the right panel of Fig. 1 
for the impact of the aspect ratio r// or r at a fixed volume 1 (Gpc/h)^. For a general aspect ratio and volume, the variance scales 
as k\\k\P^(k). Since the maximum k\\k]_ oc r^r^~ = const., minimizing the variance involves minimizing P^(k) at the maximum 
k. Increasing from small r or a "tube" geometry, k cc l/r and so the variance increases until the peak of the power spectrum 
is reached. It then declines until the minimum k is achieved when I ^ nr. The volume then becomes flattened into a "pill" 
geometry where k ~ 1// oc r^~. The variance increases until the peak of P^{k) is crossed in the opposite direction and then 
declines thereafter Since either the extreme "tube" or "pill" cases have undesirable effects of washing out structure in P^{k) via 
the convolution with the window and the smallest dimension cannot be in the nonlinear regime, this implies that for a cylindrical 
geometry the best shape for minimizing super-sample variance is / a nr. At this minimum point, the cylinder has a smaller 
variance by a factor of ~ 2 compared with a sphere of the same volume. 

The second building block of the super-sample covariance effect is the response of the nonlinear P{k) to a fractional change in 
the background density db. In Fig. 2, we study this response function dlnP/ddb as predicted by the halo model (Eq. 42). In the 
one and two halo regimes, the response function is given by the HSV and BC effects respectively. Summing up the two terms 
gives an plateau-like feature in the response function up to a certain wavenumber, k ~ I /i/Mpc for z = or somewhat smaller 
wavenumbers for the higher redshift. This implies that the super-sample variance can be absorbed by a multiplicative, constant 
factor in the power spectrum amplitude, up to the certain wavenumber In the halo model, higher wavenumber corresponds to 
smaller, less biased halos where the response decreases. Since halos of a given size are more rare at high redshift, there is an 
increase in response at higher redshift which creates a ridge at the transition from BC to HSV domination. 

We can now put these two pieces together to form the contribution to the power spectrum covariance from the super-sample 
covariance terms. Fig. 3 shows the non-Gaussian diagonal, or variance, term relative to the Gaussian expectation for a Vvy = 1 
(Gpc/Zi)^ spherical survey with bins of A log k - Q.l. As expected, the non-Gaussian contributions are larger than Gaussian 
for k Z O.l /i/Mpc where the linear to nonlinear transition occurs or equivalently where the 2-halo to 1-halo transition occurs. 



10 




k [/i/Mpc] 

no. 3. The diagonal term of power spectrum covariance as a function of /: in a ACDM model for a spherical survey volume of Vw = 
1 (Gpc//i)^, redshift z = Q, and /:-binning of A In A: = 0.1, respectively. Plotted here is the non-Gaussian covariance contributions relative to 
the Gaussian term. At A: i 0.1 /i/Mpc, the non-Gaussian errors start to dominate over the Gaussian term for the fc-binning. The bold solid 
curve is the halo model prediction for the super-sample covariance (SSC), which is broken down into the dominant components on the weakly 
nonlinear and fully nonlinear scales respectively: the halo sample variance (HSV; dashed) and the beat-coupling effect (BC; dot-dashed) (see 
Eq. 41). The red (light) solid curve is the standard trispectrum term (TO) which is subdominant or comparable throughout. 



Where the curve crosses unity specifically depends on the binning scheme since the Gaussian variance terms are suppressed as 
the number of independent ^-modes increases. 

For the halo model, the BC term dominates over other non-Gaussian contributions in the weakly nonlinear regime and the 
HSV term in the deeply nonlinear regime with a smooth transition in between, which is mediated by the cross term. The total 
SSC contribution therefore dominates or is comparable to the standard TO term everywhere. 

These results are of course limited by the accuracy of the halo model itself which does not directly suffice for future surveys. 
Weak lensing cosmology, which is the primary science diiver of planned imaging surveys, needs to use the power spectrum 
information up to A: ~ 1 /z/Mpc to attain the full potential [53]. Galaxy clustering based cosmology aims at using the information 
up to at least a few 0.1 Mpc/Zi to capture the baryon acoustic oscillations as well as to measure the redshift-space distortion effect 
[9, 54] . Thus an accurate simulation-based calibration of the background response d In P/d6b across this regime is important and 
wiU be presented in a separate paper. 



IV. DISCUSSION 

In this paper we have developed a simple, unified approach to describe the super-sample covariance of power spectrum 
estimation in a finite-volume survey. We show that the previously known effects of "beat coupling" and "halo sample variance" 
are both just limiting cases of the general response of the power spectrum to a change in the background density. 

Formally, all power spectrum covariance effects are described by the matter trispectrum due to the two-point nature of power 
spectrum estimators. The super-sample covariance term arises due to the convolution of modes across the Fourier width of 
the survey window or "squeezed" trispectrum configurations which connect pairs of closely separated short-wavelength modes 
through a long-wavelength or super-survey mode. We show that there exists a consistency relation between these squeezed 
trispectra and the response of the power spectrum to a change in the background density. This consistency relation also exposes 
why this term is a covariance: the power spectrum responds coherently across bins to a single unknown background density. 
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Our formulation is general and applicable to any survey geometry since the response to a background mode does not depend on 
the detailed geometry of the window. 

To make contact with the literature, we used the halo model trispectrum to illustrate these effects. We find that in the weakly 
nonlinear regime the response is exactly what is known as beat coupling. In the fully-nonlinear regime it is exactly what is 
known as halo sample variance. The joint effect is the dominant non-Gaussian covariance for a wide range of survey volumes 
and its accurate calibration will be important for high-precision cosmology using large-scale structure probes. Our description 
also exposes the fact that accurate calibration is straightforward. Since the effect is the response of the power spectrum to a 
change in the background density, it only requires running two simulations with different background parameters to calibrate at 
any given redshift. 

Our construction also exposes the possibility that power spectrum super-sample covariance need not be considered as an 
additional source of noise at all, but rather an additional signal of known shape but unknown amplitude given by the background 
density in the survey. By including it in the covariance, one pre-marginalizes the impact of the unknown amplitude. Alternately, 
one can fit for its amplitude given the template power spectrum response under a prior given by its expected variance in the 
survey window. This approach also directly exposes its impact on cosmological parameter estimation as different parameters 
will have different degeneracies with this background mode (see Ref. [27] for the similar discussion). It can also protect against 
unlikely realizations of the data by examining the dependence of results on the prior 

In principle, a joint fit could recover some information on super-survey modes from the observable sub-survey power spectrum 
albeit limited by degeneracies with global cosmological parameters. This opens up an interesting possibility to explore very 
large-wavelength fluctuations based on the background mode-coupling to short-wavelength observables. 
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Appendix A: Power spectrum covariance for projected density fields 

The covariance formalism developed in the main paper can be directly applied to any statistic that is derived from the matter 
power spectrum. In particular, many applications such as weak lensing involve a weighted 2D projection of the density field. 
We use the flat-sky approach developed in Refs. [15, 27] for the two point statistics of a single projected field but the formalism 
can easily be generalized to cross spectra of multiple fields [52, 55, 56] or all-sky statistics [57-60]. 

We define the projected density field as 



E(0) = j dx f{x)6(xe,x), (Al) 

where we have assumed a spatially flat universe with radial coordinate ;if(z) - J^~dz'/H{z) and fix) is the radial selection 
function. For the weak lensing field, f(x) is given by Eq. (4) in Ref. [27]. Taking into account the survey window function, the 
observed field is given as 

Uv(e) = ^(0)2(0), (A2) 

where "WiO) is the survey window function on the sky; 'W{6) = 1 if the pixel in the direction 6 on the sky is in the survey region 
or contains data, otherwise "WiO) - 0. 

Under the flat-sky approximation, the Fourier-transformed field becomes 

t^H)^ f^^V(l-l'M'). (A3) 

Similarly to Eq. (3), the power spectrum estimator for the projected field is defined as 

C(/,)=-?- r ^E^(l)£^(-l), (A4) 

J|l|e/, '^/, 
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where Q^y is the effective survey area defined as Q^v = jd^O "WiO), and Q/, = j^i^^, d^l - InUM when AZ/Z, « 1. 

In the limit that the angular mode of / is much greater than the width of the window function, the power spectrum estimator is 
unbiased in a sense that the ensemble average gives the underlying true power spectrum; 

(C(/,)) = C{U). (A5) 

As in the 3D case, we can derive the covariance of the 2D power spectrum including the effect of the window function: 

Cij = {CddCUj)) - CUdCdj) 

(A6) 



1 



(27r)2 



^ CUdSfj+T'^ihJj) 



where 



J|l|e/, i2/, J\\'\elj J (2w) 



(27r)24(qi234)7-(l + Qi, -1 + q2,l' + q3, -1' + q4). 



(A7) 



Using the Limber approximation, the angular power spectrum and trispectrum can be related to their 3D counterparts via 

C{1) 



j dxf{x)x-^P{^\x\ 



T-(li,l2,l3,l4) 



J dxfix). 



i;if-'>r(ki,k2,k3,k4;;rX 



where k, = hlx- 

The trispectrum consistency relation then implies 



(A8) 



(A9) 



where 



G 1 (27Tf K 



TO 1 C d-\ r dH' , , 

J|l|e/,. ii/i J|1'|E/, ii/,- 



-ssc 



— dx fix) X 
■'W ^ 



r dH , - , 



(AlO) 



where we have again used the Limber relation k, = li/x- With the halo model expressions for the response function from Eq. (42) 
these relations reproduce the results in the previous work, Eq. (18) in Ref. [19] and Eq. (17) in Ref. [27] for the HSV and BC 
contributions, respectively. In principle, background modes also change the k / relation through perturbations to the distance 
redshift relation. However such modes would have to be constant out to the maximum distance in the integral and are typically 
much smaller than the 61, considered here. 
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